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Abstract—The nsp 14 protein, an exoribonuclease of the DEDD superfamily encoded by severe acute respira- 
tory syndrome coronavirus (SARS-CoV), was expressed in fusion with different affinity tags. The recombinant 
nsp14 proteins with either GST fusion or 6-histidine tag were shown to possess ribonuclease activity but nsp14 
with a short. MGHHHHHHGS tag sequence at the N-terminus increased the solubility of nsp14 protein and 
facilitated the protein purification. Mutations of the conserved residues of nsp14 resulted in significant attenu- 
ation but not abolishment of the ribonuclease activity. Combination of fluorescence and circular dichroism 
spectroscopy analyses showed that the conformational stability of nsp14 protein varied with many external fac- 
tors such as pH, temperature and presence of denaturing chemicals. These results provide new information on 
the structural features and would be helpful for further characterization of this functionally important protein. 
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INTRODUCTION 


Severe acute respiratory syndrome coronavirus 
(SARS-CoV) is the causative agent for the epidemic 
in year 2002 [1-3]. It is a positive-strand RNA virus 
whose gene expression and genome replication 
require polyprotein synthesis and subgenomic pro- 
duction [4-8]. Coronavirus replication requires sev- 
eral viral encoded proteins, including RNA-dependent 
RNA polymerase, RNA helicase, endoribonuclease, 
as well as a unique exoribonuclease [7, 9-10], The 
possible role of this exoribonuclease of SARS-CoV, 
nsp14, in the virus replication is reported [5, 9]. 
Although it has been proposed that nspl4 may play a 
role in the proof-reading, RNA repair and/or recombi- 
nation for maintaining the integrity of the unusually 
large RNA genome of coronaviruses [7], the actual 
biological function of nsp14 involved in viral RNA 
replication is still unknown. To gain more insight into 
the biological functions and molecular mechanisms of 
nsp14, further characterization of its biochemical and 
conformational properties is needed. 


Production of soluble recombinant proteins is vital 
for structure-function analysis. It is important to select 
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a proper vector and affinity tags for protein expression 
and purification [11]. Some tags such as maltose-bind- 
ing protein (MBP) or glutathione S-transferase (GST) 
are used for both affinity purification and solubility. 
As mentioned previously MBP and GST have long 
been used to increase the solubility of complex pro- 
teins in Escherichia coli [11-13]. The hexahistidine 
tag (His6-tag) was widely used as affinity tag, its 
small size and the robust nature of the resin purifica- 
tion contributed to its popularity [11, 14]. 


In this study, the recombinant nsp14 of SARS-CoV 
was expressed in EF. coli in three different forms and 
all of them possessed an active exoribonuclease activ- 
ity, however, the nsp14 with a short modified MGHH- 
HHHHGS tag exhibited higher solubility and facili- 
tated the purification. Mutations to the conserved res- 
idues of nsp14 significantly attenuated the enzymatic 
activity. The conformational stability of nsp14 protein 
was analyzed by fluorescence and circular dichroism 
(CD) spectroscopy after exposure to various stressed 
conditions including different concentrations of dena- 
turant, different pH and elevated temperature, and 
these biophysical analyses provided new information 
on the structural features of the SARS-CoV exoribo- 
nuclease. 
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Oligonucleotides used for PCR amplification 


Primer usage 


Primer sequence* 


HT-nsp 14 


HTL-nsp14& 
GST-nsp14 
D5992A& 
E5994A 
D6145A 


H6170A 


F: 5'-ageccatgggccatcatcatcatcatcacggatccgcagaaaatgtaactgga-3' 
R: 5'-ctgcagtcgacttactgtaacctggtaaatgt-3' 


F.: 5'-ctcgaggatccgcagaaaatgtaactgga-3' 

R.: 5'-ctgcagtcgacttactgtaacctggtaaatgt-3' 

FM: 5'-tggattggctttgctgtagcgggctgtcatgca-3' 
RM: 5'-tgcatgacageccgctacagcaaagecaatcca-3' 
FM: 5'- tttatgattgctgttcagcag-3' 

RM: 5'- ctgctgaacagcaatcataaa-3' 

FM: 5'- tggaaatgcacttgtggctagtt-3' 

RM: 5'- aactagccacaagtgcatttcca-3' 


Note: * F and R are, respectively, the forward and reverse primers for cloning into expression vector; FM and RM are, respectively, the 


forward and reverse primers used for site-directed mutagenesis. 


EXPERIMENTAL 


Construction of fusion nsp14 protein. The 
SARS-CoV cDNA was synthesized by reverse tran- 
scription using total RNAs extracted from Vero E6 
cells infected by the virus isolate WHU (GenBank 
accession number AY394850), using poly d(T) 
primer. The nsp/4 gene was PCR amplified from the 
SARS-CoV cDNA using oligonucleotides shown in 
Table 1 and ligated into pGEX-4T-| or pET-28a vector 
after digestion with BamHI and Sall. The resultant 
plasmid, pGST-nsp14, pHTL-nsp14, encoded SARS- 
CoV nsp14 with a GST or His6 tag with the linker 
sequence from the vector in the N-terminus. To 
remove the extra vector sequence, the PCR product 
was digested with Ncol and Sail and inserted into the 
Ncol and SalI sites of the vector pET-28a. The result- 
ing plasmid, pHT-nsp14, encoded nsp14 with a MGH- 
HHHHHGS sequence at the N-terminus without other 
linker sequence. 


Protein expression and purification. Plasmid 
pHT-nsp14, pHTL-nsp14 and pGST-nsp14 were 
transformed into E. coli BL21 (DE3) cells. Cultures 
were grown at 37°C until the Agog reached 0.8 and then 
induced with 0.5 mM isopropyl-1-thio—D-galactopyr- 
anoside at 20°C for 16 h. The GST fusion protein from 
pGST-nsp14 was purified using Glutathione 
Sepharose 4B resin (Amersham Biosciences) and 
His6 fusions from pHT-nsp14 and pHTL-nsp14 were 
purified by Ni-NTA agarose beads (Qiagen). Further 
purification was done by gel filtration chromatogra- 
phy over Superdex-75 (Amersham Biosciences). For 
gel filtration experiments, the column was equili- 
brated with PBS and eluted with the same buffer. The 
purified protein was confirmed by sodium dodecyl 
sulfate-polyacrylamide gel electrophoresis (SDS- 
PAGE) and mass spectrometry. The amount of protein 
was measured using BioRad protein assay kit. 


Preparation of RNA substrates. A chemically 
synthesized, hairpin RNA-encoding DNA sequence 
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(GGATCCCGCATTCTATCCTCTAGAGGATGTTC- 
AAGAGACATCCTCTAGAGGATA GAATGTTTT- 
TTGGAAAAGCTT) was ligated into the BamHI- 
Hind Il] site of pCR3.1 vector. The DNA construct was 
digested with EcoRI and transcribed by T7 RNA poly- 
merase. The RNA was purified from denaturing 7 M 
urea-8% polyacrylamide gels (19:1 of acrylamid to 
bisacrylamide) and labeled at the 5'-end with T4 poly- 
nucleotide kinase and [y-**P]ATP. After the labeling, 
the RNA was purified using G-25 column. 


Site-directed mutagenesis. Site-directed 
mutagenesis was done by splicing via overlap exten- 
sion (SOE) PCR [15]. Nucleotide sequences of the 
primers used for site-directed mutagenesis are given 
in Table 1. The sequences of all the constructs were 
confirmed by DNA sequence analysis. The correct 
plasmids encoding mutant forms of nsp14 were then 
transformed into E. coli BL21 (DE3) cells and the 
recombinant proteins were expressed and affinity 
purified as described above. The purity of the mutant 
proteins was analyzed by SDS-PAGE. 


Exonuclease assays. The standard RNA nuclease 
assay was carried out as previously described [16-19]. 
Standard reaction volume including 1 kBq of radiola- 
beled RNA substrate and 2.3 uM of nsp14 in a buffer 
consisting of 50 mM Tris-HCl (pH 8.0), 50 mM KAc, 
2 mM dithiothreitol, 10% glycerol, 0.1 mg/ml BSA, 
5 mM MgAc,, 25 mM NaH,PO,. The reaction mixture 
was incubated at 37°C for 2 min unless stated other- 
wise. Reactions were terminated by the addition of the 
equal volumes of gel-loading buffer (containing 96% 
formamide and 1 mM EDTA). The mixture was 
heated to 85°C for 2 min and put to ice bath immedi- 
ately. Products were analyzed by denaturing gels com- 
posed of 7 M urea and 8% polyacrylamide. Gels were 
dried and exposed to a Phosphorlmager screen (Amer- 
sham Biosciences). 


Circular Dichroism spectroscopy. The circular 
dichroism (CD) spectra were recorded using a Jasco 
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Fig. 1. Expression of recombinant SARS-CoV nsp14. (a) Schematic representation of the different strategies of expressing nsp14 
protein. (b) SDS-PAGE of the GST-nsp14 protein purified from E. coli cells. Lane /: molecular weight makers, Lane 2: GST-nsp14 
supernatant after sonication, Lane 3: GST-nsp14 pellet after sonication, Lane 4: protein eluted from GST affinity column. (c) SDS- 
PAGE of the HTL-nsp14 protein purified from E. coli cells. Lane 5: molecular weight makers, Lane 6: HTL-nspl4 supernatant after 
sonication, Lane 7: HTL-nsp14 pellet after sonication, Lane 8: HTL-nsp14 eluted from NTA-Ni*+ affinity column. (d) SDS-PAGE 
of the HT-nsp14 protein purified from E. coli cells. Lane 9: molecular weight makers, Lane /0: HT-nsp14 supernatant after sonica- 
tion, Lane //: HT-nsp14 pellet after sonication, Lane /2: HT-nsp14 eluted from NTA-Ni2* affinity column, Lane /3: the HT-nsp14 
purified by gel filtration chromatography on Superdex-75 column. 


J-810 spectropolarimeter in 40 mM Tris-HCl buffer 
(pH 7.2) at 20°C. A cell with a path length of 0.1 mm 
was used for far-UV CD spectrum. Each spectrum was 
the average of four scans corrected by subtracting a 
spectrum of the buffer solution in the absence of pro- 
tein recorded under identical condition. Each scan in 
the range of 200-250 nm was obtained by taking data 
points every 0.5 nm with integration time of 1s and a 
2 nm bandwidth. 


Fluorescence measurements. All fluorescence 
measurements were performed in a RF-5301PC 
speetrofluorophotometer. Slit widths with a nominal 
band pass of 3 nm were used for both excitation and 
emission beams. Intrinsic tluorescencc emission spec- 
tra were recorded from 300 to 500 nm after exciting at 
275, 280 and 295 nm. The emission intensity value at 
~343 nm was evaluated in each case from the scans. 


RESULTS AND DISCUSSION 
Expression of SARS-CoV nsp14 Protein 


To facilitate biochemical and biophysical analyses 
of SARS-CoV nsp14 protein, we first optimized the 
expression and purification conditions for this protein. 


As shown in Fig. 1, SARS-CoV nsp14 was expressed 
in E. coli with a 6XHis or GST tag fused at the N-ter- 
minus in different forms. The GST-nsp14 contained a 
large part of GST fusion while HTL-nsp14 carried a 
6xHis tag with a long vector sequence. As shown in 
Fig. 1b, GST-nsp14 protein showed 30% solubility 
when expressed at 20°C (lanes 2-3 in Fig. 1b) and it 
could be purified by GST-affinity chromatography 
(lane 4 in Fig. 1b). To obtain an intact protein without 
extra sequence, we used thrombin protease to remove 
the GST tag after purification but the protein became 
easily to aggregate in solution after losing the GST tag 
(data not shown). The HTL-nsp14 protein with His6 
tag and a long linker sequence from the vector at the 
N-terminus was produced mostly in the form of inclu- 
sion body and was poorly purified by Ni-NTA affinity 
column (lanes 6-68 in Fig. Ic). 


As both GST-nsp14 and HTL-nsp14 had problems 
with either stability or solubility, we expressed nsp14 
with a short tag (MGHHHHHHGS) in the N-terminus 
by removing all other vector sequence contained in 
HTL-nsp14, and resulted in the recombinant protein 
HT-nsp14. A significant increase in yield and solubil- 
ity (about 50%) was observed with HT-nsp14 
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Fig. 2. The exoribonuclease activity associated with the purified HT-nsp14 (a) and GST-nsp14 (b). The RNA substrate was aU AP i 
radiolabeled. The incubation time is shown above the lanes. The control RNA incubated without nsp14 is marked R. (c) Effect of 
RNaseA inhibitor on cleavage of 5'-radiolabeled RNA by nsp14. The exoribonuclease reaction was performed in the same buffer 
conditions as described in Fig. 2a, except that 400 U of RNaseA inhibitor was present. The reaction was carried out at the time points 
indicated in the horizontal axis. (d) Effect of heparin on cleavage of 5'-labeled hairpin RNA by nsp14. RNA digestion was performed 
in the absence or presence of various concentrations of heparin. Uncleaved full-length substrate RNA was quantified and shown as 


a percentage of input full-length RNA ([FL (%)]). 


(lanes /0-// in Fig. 1d). After initial purification on a 
Ni-NTA affinity column, the purity of HT-nsp14 
reached 90%, and with an additional step of gel filtra- 
tion chromatography on Superdex-75 column (Amer- 
sham, Inc), the protein was purified to near homoge- 
neity (lanes /2—/3 in Fig. Id). This tag is very small, 
which does not need removal of the tag during the fur- 
ther structural and functional studies, in fact for struc- 
tural studies, more than 60% of the proteins produced 
include a polyhistidine tag, and many examples of 
proteins crystallized with His-tag left intact are 
reported [13]. At the same time, Gly-Ser-coding 
sequences can be made to contain BamHI or Bgill 
sites and this can facilitate the cloning work. During 
the following experiments, the analyses of the confor- 
mational stability of nsp14 were performed with HT- 
nsp14. 
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The Ribonuclease Activity 
of SARS-CoV Protein nspl4 


To determine whether the different forms of the 
recombinant nsp14 protein had the ribonuclease activ- 
ity, the protein was incubated with the 5'-radiolabeled 
RNA molecules that was obtained by in vitro tran- 
scriptional system. As shown in Fig. 2a and 2b, the 
RNA substrate was digested to many small fragments 
with similar patterns by either GST-nsp14 or HT- 
nsp14. The digestion of the RNA became more exten- 
sive and the product fragments became smaller as the 
incubation time was longer. These results suggested 
that both GST-fused and 6His-tagged nsp14 proteins 
possessed a ribonuclease activity. 


To rule out the possibility that the ribonuclease 
activity was from the contaminated RNase A that was 
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Fig. 3. Mutational analysis of nsp14 exoribonuclease. (a) Sequence alignment of nsp14 with its homolog, showing the regions con- 
taining the highly conserved residues that are the putative active center amino acids. The five highly conserved residues predicted 
to be important for activity are indicated by *. Four of these five residues were mutated and their activity was tested as shown in 
panel b. (b) Exoribonuclease activity of the nsp14 mutants. Lane 1 contains only RNA substrate. The mutants and wildtype (WT) 
nsp14 are indicated in Lanes 2—5. The exoribonuclease activity assay was performed using the condition as described in Fig. 2a. 
The incubation time for the reaction was 60 min at 37°C. (c) A chart quantify the activity of the mutants as % of wildtype (WT), as 


shown in panel (b). 


co-purified, we added the RNase A inhibitor to the 
same reaction mixture. The result showed no inhibi- 
tion of the RNA digestion by RNase A inhibitor 
(Fig. 2c), suggesting that no RNase A activity was 
involved in digesting the RNA substrate. On the other 
hand, the heparin is a competitive inhibitor of nucleic 
acid binding and nucleic acid binding is a prerequisite 
for nuclease activity [20, 21]. Therefore, we tested the 
influence of heparin on the nsp14 enzymatic activity 
and the results showed that heparin could inhibit the 
exonuclease activity in concentration-dependent man- 
ner (Fig. 2d). 


Mutational Analysis of nsp14 


SARS-CoV nsp1l4 and other members of the 
DEDD superfamily proteins have a characteristic four 


acidic amino acids that are absolutely conserved 
(D5992, E5994, D6145, D6175), plus a highly con- 
served residue of H6170 (Fig. 3a) [22]. We carried out 
the point mutations of the four residues, 
D5992/E5994, D6145, and H6170, all to Ala, to 
examine the functional role of these residues. The 
results showed that each mutation had significant 
impact on the enzymatic activity and all mutants dis- 
played significantly lower ribonuclease activity than 
the wildtype protein (Fig. 3b and 3c), demonstrating 
the critical rote of these conserved residues. Nonethe- 
less, all these mutants still retained low level of exor- 
ibonuclease activity, and a complete abolishment of 
the activity may require a combination of two or more 
of these mutations. 
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Fig. 4. (a) Fluorescence emission spectral of nsp14 under different excitation. (b) Fluorescence emission spectral of nsp14 with dif- 
ferent denaturants. (c) Fluorescence spectrum of nsp14 under different concentrations of GuHCI. (d) Wavelength of maximum emis- 


sion spectrum of nsp14 under different concentration of GuHCl. 


Fluorescence Spectrum Characterization 
of nsp14 


Fluorescence emission spectra obtained for HT- 
nsp14 (Fig. 4a) are typical of those generally observed 
for Trp-containing proteins [23], despite of relatively 
high content of Tyr residues within the studied protein 
molecules. Nsp14 has similar fluorescence emission 
spectrum using different excitation spectrum and has 
maximum emission spectrum at about 343 nm. The 
fluorescence intensity is lower at 296 nm than that at 
275 and 280 nm (Fig. 4a). The maximum emission 
spectrum of the enzyme (343 nm) is blue shifted rela- 
tive to that of free L-tryptophan, which is observed to 
be at 354 nm under the same conditions, indicating 
that the tryptophan residues in nspl4 are not com- 
pletely exposed to the solvent. This shielding of the 
tryptophan residues from the solvent phase is the 
result of the folding and three-dimensional structure 
of the protein. 


Effect of Denaturant on the Stability of nsp14 


Tryptophan residues in native proteins are not 
found in identical locations, nor are they equally influ- 
enced by the environment. The microenvironment of 
every residue is characterized by a particular set of 
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physico-chemical conditions (polarizability, micro- 
viscosity, availability of charged groups, possible spe- 
cific interactions, etc.) that influence chromophore 
fluorescence. As a consequence, the protein fluores- 
cence is conditioned by the sum of fluorescent contri- 
butions of individual tryptophan residues, which vary 
over a rather wide range [24]. 


As it is observed in Fig. 4a, the maximum wave- 
length (Amax) of nsp14 tryptophan fluorescence was 
343 nm. When the protein is in the presence of 8 M 
urea and 6 M GuHCl -containing aqueous solution, 
Amax had a large red-shifting to 351 nm and 354 nm 
(Fig. 4b). This red shifting may be explained in terms 
of increased exposure of Trp residues to solvent, 
resulting from the process of protein denaturation [23, 
25]. The red shift in emission maxima was also 
accompanied by an increase in the half-maximal spec- 
tral width of the fluorescence spectra (Fig. 4b). This 
increase was largely due to the emergence and contri- 
bution of tyrosine residues to the emission spectrum 
upon denaturation and the formation of more confor- 
mationally altered species during the denaturation 
process [26]. 


The fluorescence spectra of native nsp14 showed 
maximum emission spectrum at 343 nm. Incubation 
of nsp1l4 at increasing concentrations of GuHCl 
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Fig. 5. Circular dichroism spectrum analysis of nsp14. (a) Effect of pH on the secondary structure of nsp14. (b) The thermal unfold- 
ing followed at 222 nm with a temperature range from 30 to 95°C. 


resulted in a progressive change of intrinsic fluores- 
cence emission to a longer wavelength, 354 nm 
(Fig. 4c and 4d). This is due to the fact that the Trp 
residue, which is partly buried in the native form, is 
exposed on the surface in the denatured protein. At 
GuHCl concentrations higher than 4 M the fluores- 
cence intensity measured at 343 nm drops to about 
40% of its value observed in solutions with no GuHCl 
present (Fig. 4c). This decrease of fluorescence inten- 
sity obviously results from the denaturation of protein 
and may be explained in terms of increasing polarity 
of the Trp environment as it becomes fully exposed to 
the solvent. 


Effect of pH and Temperature on the Structure 
of nsp14 


To examine the pH and thermal stability of nsp14, 
we recorded circular dichroism spectra under a chang- 
ing condition (Fig. 5). The spectrum of nsp14 at room 
temperature (22°C) displays a a-helical band with 
double negative ellipticity at 222 nm and 208 nm. At 
the same time, slightly different from the classical 
helical spectrum, the spectrum of nsp14 showed that 
the mean residue ellipticity at 208 nm was of substan- 
tially higher magnitude than that at 222 nm, which 
indicated that nsp14 is a protein mainly consists of O- 
helices and B-sheets; belong to the protein class of a+ 
B (Fig. 5a, the bottom line) [27]. 


The effect of changing pH condition on the second- 
ary structure of nsp14 is shown in Fig. 5a. By shifting 
the pH in the range pH 3 to 7.4, the a-helix is reduced 
with lower pH. The far-UV CD spectra revealed a 
clearly difference below pH 5.5, suggesting a struc- 
tural transition from a B-helix structure to a slightly 
more f-sheet structure. It has been observed that a- 
helices may undergo structural transition in acidic 
solution and at elevated temperatures [28]. 


The thermal stability of nsp14 was assessed by 
monitoring the changes in the a-helix content of the 
protein at 222 nm during the temperature range from 
30 to 95°C. In the interval from 30 to 50°C, the CD 
spectrum of nsp14 was steady. Under further heating 
from 50 to 75°C, the CD spectrum underwent a change 
to another pattern with approximately three-fold 
lower the mean residue ellipticity at 222 nm. These 
data indicated that the helical structure of nsp14 was 
disrupted by the thermal treatment. The result shown 
in Fig. 5b indicated that the Tm value of nsp14 is 
approximately 58°C. 


Protein folding is a central problem in biochemis- 
try and has continued to receive considerable atten- 
tion. Proper evaluation of protein’s stability and fac- 
tors that contribute to the conformational preference 
for native form relative to unfolded form(s) are critical 
for understanding protein folding and function. The 
results presented here provide some straightforward 
information about the structural stability of nsp14, 
which may provide the basis for further studies about 
biological functions of SARS-CoV nsp14. 
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